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Abstract 

The Benes network has been used as a rearrangeable network for over 40 years, yet the uniform N{2\ogN — 1) 
control complexity of the NxN Benes is not optimal for many permutations. In this paper, we present a novel 0(log A*') 
depth rearrangeable network called KR-Benes that is permutation-specific control -optimal. The KR-Benes routes every 
permutation with the minimal control complexity specific to that permutation and its worst-case complexity for arbitrary 
permutations is bounded by the Benes; thus it replaces the Benes when considering control complexity/latency. We 
design the KR-Benes by first constructing a restricted 2 log K + 2 depth rearrangeable network called A'-Benes for 
routing Tf-bounded permutations with control 2N log K ,Q < K < N/A. We then show that the NxN Benes network 
itself (with one additional stage) contains every A-Benes network as a subgraph and use this property to construct the 
KR-Benes network. With regard to the control-optimality of the KR-Benes, we show that any optimal network for 
rearrangeably routing if -bounded permutations must have depth 2 log 7^-1-2, and therefore the A-Benes (and hence 
the KR-Benes) is optimal. 
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I. Introduction 

The main result of this paper can be summarized as "The Benes network structure is more powerful than imagined". 
Simply put, in this paper we present the interesting result that by making a few small modifications to the Benes 
network [1], the resulting rearrangeable network can route most permutations with much lower latency and control 
overhead. The worst-case control complexity of the new rearrangeable network for arbitrary permutations is in fact 
bounded by the Benes. 

Rearrangeable networks such as the Benes contain edge-disjoint paths from inputs to outputs for all A^! possible 
permutations. They find widespread use in shared-memory multiprocessor systems ( [2], [3]), telecommunication 
networks, time division multiple accessed (TDMA) systems for satellite communication [4] and newer applications 
such as switching fabrics in internet routers. Many packet switches based on combined input-output queing models 
[5], require a permutation network in the middle of the switch fabric for routing packets from input queues to output 
ports during each time slot. High-speed optical switching networks which require crosstalk-free routing also rely on 
rearrangeable networks [8,9]. 



Given their widespread use, improving the control/hardware complexity of rearrangeable networks has tremendous 
impUcations. This therefore raises the question: How optimal is the Benes network in this regard? The Benes network 
is one of the oldest and best-known rearrangeable networks, in use for around 40 years. Waksman showed in [10] that 
the Benes is the smallest (in terms of depth) 2x2 switching element based rearrangeable network' with a depth^ of 
2 log N — 1. Routing in the Benes can be accomplished using the looping algorithm [12], which decomposes a given 
permutation into two subpermutations that can be routed independently in the subnetworks. The computational time 
complexity of the looping algorithm (i.e control complexity of the Benes) is A'^(2 \ogN — l). 

There is a vast body of literature examining the rearrangeabiUty of the Benes and other isomorphic MIN based 
networks and developing new routing algorithms for these networks. Lee [11] proved the rearrangeability of omega- 
omega"' networks by describing passabiUty conditions for permutations. The rearrangeabiUty of synmietric MINs 
was studied by Yeh and Feng [14] and Kim et al [13]. These networks have the same hardware and control complexity 
as the Benes (i.e 0(log N) depth and 0{N log N) control complexity). Networks with much larger depths have also 
been designed as permutation networks. For example, Koppelman and Oruc [15] describe a self-routing permutation 
network with 0(log^ N) depth and 0{N log^ N) switching elements. This is improved to 0{N log^ N) switching 
elements in [16] and 0(log^ N) depth in [17]. Unlike the Benes which is composed only of simple 2x2 switching 
elements these networks have a factor of 0(log N) more depth and switching elements and require sequential binary 
adders and comparators [15,16] and hyperconcentrators [17] which can merge large sequences in parallel using high 
fan-in logic gates. [18] describes self-routing permutation networks based on De-Bruijn Graphs. 

Several results on parallel control of the Benes have also been derived. Nassimi and Sahni [7] describe an 0(log^ N) 
algorithm using 0{N) processors. In [19], Lee and Zheng describe a fast parallel routing algorithm for Benes based 
networks called Benes group connectors with K active inputs in 0{log'^K + log N) time on a completely connected 
computer or the EREW PRAM model with N processors . In [9], Yang et al. prove that an N x N Benes can route 
crosstalk free permutations (i.e no switching elements has more than one active input at a given time) in two passes. 
Their control algorithm requires spUtting the original permutation into non-blocking crosstalk free permutations and 
can potentially be parallelized. However the control complexity of the Benes for each permutation is still 0{N log N). 

In an attempt to reduce the control complexity of the Benes, Feng and Seo [20,21] proposed a new routing algorithm 
called inside-out routing. They adopted a new approach to routing by starting from the center stages and moving 
outwards. The control complexity of the Benes network using this algorithm is claimed to be 0{N). However, Kim, 
Yoon, and Maeng [22] have refuted the claim made in [21]. They show that the inside-out routing algorithm requires 
backtracking due to input-output assignment conflicts and its complexity is no longer 0{N). They also show that even 
with backtracking, the modified inside-out routing algorithm may not be able to find conflict-free assignments for all 
permutations. Thus to the best of our knowledge, there are no existing 0(log A'') depth networks using 2x2 switching 
elements in the Uterature (such as the Benes) that improve its control complexity. 

Our results in this paper are motivated by the fact that the hardware and control complexity of the Benes is uniform 

^Note that other well-known permutation networks such as the Clos and Crossbar [2] have fewer stages than the Benes but they are not based 
on 2 X 2 switching elements. Being non-blocking, these are obviously more powerful than the Benes, but are correspondingly more complex to 
implement (0{N'^ ) crosspoints, as opposed to 0( AT log AT) for the Benes). 

■^AU logarithms are to base 2. 



for routing all permutations. For example, both identity as well as inverse permutations are routed in a similar manner 
through the Benes, though they are very different in structure. Thus our objective is to design a permutation- specific 
control-optimal 0(log N) depth rearrangeable network i.e one that optimally routes every permutation with minimal 
control complexity specific to that permutation. We show in this paper that such a network can be designed simply 
by considering bounded permutations. We describe the rearrangeable KR-Benes network whose control complexity is 
superior to the Benes on average and whose worst case control complexity is bounded by the uniform A'^(2 log N —1) 
control complexity of the Benes. We design the KR-Benes by first constructing a restricted network called K-Benes 
for rearrangeably routing /C-bounded permutations, which are defined as permutations satisfying |7r(i)— i| < fc, for all 
inputs i, with K the smallest power of 2 integer > k. We show that any rearrangeable network for routing ii'-bounded 
permutations must have depth at least 2\ogK + 2,0 < K < N/2 (depth at least 2 log -|- 1 for = N/2, N). The 
ii'-Benes satisfies these constraints and is therefore optimal. 

We show that every ii'-Benes is contained as a subgraph man N x N Benes network with one additional stage. 
(Not all rearrangeable networks for ii'-bounded permutations have this subgraph property, as we discuss in Section 
3). Based on this, we use -ff -Benes networks as building blocks in constructing the permutation- specific control- 
optimal KR-Benes ^. In one implementation, the KR-Benes contains 3 log N—3 stages (of which only min(2 log K + 
2, 2 log A?^ — 1) stages are used for routing a given permutation). Alternate implementations of the KR-Benes with 
2 log N colunms of switching elements using multiplexors can also be derived. The control algorithm for the KR- 
Benes is a simple modification of the Benes looping algorithm and its complexity is 2N log K which is bounded by 
the N[2 log N — 1) complexity of the Benes. 

The paper is organized as follows: Section 2 contains brief terminology and background on rearrangeable networks. 
In Section 3 we describe the K-Benes architecture and control algorithm for routing ii'-bounded permutations. Section 
4 discuses lower bounds on the depth and control complexity of a -fC-bounded rearrangeable network and the optimality 
of the iV'-Benes. Section 5 describes the KR-Benes and Section 6 concludes the paper. 

II. Background 

An N X N switching network denotes a network for interconnecting N inputs and N outputs. A switching network 
capable of handling all possible permutations on N is called a permutation network. Permutation requests from 
traffic inputs arise in many cases, for example in circuit switched networks for teleconnmunications, combined input- 
output queueing based packet switches and routers, and cross-talk free optical networks. A permutation network is 
rearrangeable if for any permutation n, we can construct edge-disjoint paths in the network Unking the i*^ input to 
the n{iY^ output for < i < N — 1. The Benes network is an example of a rearrangeable network. 

There are several ways to describe the architecture of the Benes. We describe it in terms of Butterfly and Inverse 

Butterfly networks. An NxN Butterfly [2] consists of log N colunms (or stages) of 2 x 2 switching elements arranged 

in the recursive structure shown in Figure 1. An Inverse Butterfly network is the mirror image of a Butterfly. 

*Note that as a stand-alone network, the K-Benes is useful when incoming permutations exhibit static boundedness (i.e., the maximum value of 
K is bounded and does not increase with time). For such scenarios, a X-Benes network can be used instead of the standard NxN Benes with 
concommitant hardware and control complexity benefits. However, in most realistic cases, the value of K among permutation requests wiU change 
dynamically over time. Therefore the real utility of the Jf-Benes lies in its use as a building block for the KR-Benes network. 
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(I) Recursive structure of the Butterfly Network (ii) An 8 x 8 Butterfly Network. 

An NxN Benes network consists ofanNxN Butterfly foUowed by an Inverse Butterfly. Thus the Benes contains 
2 log N stages of switching elements. However the last stage of the Butterfly network can be merged with the first 
stage of the Inverse Butterfly to decrease the total number of stages to 2 log N—1. Figure 2 iUustrates an 8 x 8 Benes. 
Note that the recursive structure of the Butterfly and Inverse Butterfly networks automatically leads to a recursive 
decomposition of the Benes network. An NxN Benes can be viewed as consisting of two outer stages of switching 
elements connected to top and bottom ^ ^ T Benes subnetworks. This is very useful in deriving the looping algorithm 
[2] for routing permutations in the network. 

4x4 Benes 




4x4 Benes 
Fig. 2 

Network architecture and control algorithm for an 8 x 8 Benes. 



Figure 2 also shows an example of the looping algorithm. It begins by setting a switching element in the outer left 
stage such that companion inputs are directed to top and bottom subnetworks. The destination switching elements in 
the outer right stage must automatically be set to receive these inputs from the correct subnetworks. By alternately 
setting switches in the outer stages, the entire permutation can be divided into two smaller permutations to be reaUzed 
at each Benes subnetwork. The looping algorithm sequentially examines the inputs in each stage and hence the control 
complexity of the Benes is 0{N log N). The paraUel version of the looping algorithm has complexity 0(log^ N) using 
0{N) processors [7]. 



III. K-Benes: N X N Rearrangeable Network for /^-Bounded Permutations, < K < N/4 

A if-bounded permutation n is one which satisfies the condition: |7r(z)— i| < k, for all inputs i,0 < i < N—1, kis 
any integer in [0 . . .N — 1] and K < N is the smallest power of 2 integer > k. In general, there are K\{K + l)^-^ 
such permutations, for K < N/2. (The first N — K inputs have K +1 output choices while the last K together have 
K\ choices). 

Given a i^-bounded permutation, we consider two problems: 

• What is the optimal depth rearrangeable network that can route /C-bounded inputs to outputs (assuming only 
2x2 switching elements)? It seems evident that the optimal network will have depth 0{logK), but the exact 
constants need to be determined since this will affect control complexity. 

• Can this optimal but restricted rearrangeable network be designed so as to be an efficient building block for a 
network that control-optimally routes every permutation? A trivial (but non-efficient) solution is to have log A'' 
parallel copies of the network of problem 1 for K = 0,2,4,8, .. .. 

We now describe our rearrangeable network for iiT -bounded permutations, the TV-input K-Benes. In the next sec- 
tion, we show that it is optimal. 

Consider any /iT-bounded permutation tt. Divide the set of N input and output lines numbered from 0, 1 ... , TV— 1, 
into N/K bands of K contiguous inputs and outputs each. Each such set of input lines is labeled li, where li = 
[iK, . . . + 1)K-1], < i < N/K-1. Similarly, Oi refers to the z*'* output band. Note that the input and output 
bands refer to the same set of numbers. Also in our notation, when we refer to input i, we are referring to the packet 
(or connection) on input line i, which may be destined to any output. Thus for the given permutation tt, inputs in li 
are destined only to the same or adjacent output bands, i.e., 7r( j) G {Oi^i, Oi, Oi+i}, Vj G li- For example, in a 
4-bounded permutation, (packets on) inputs 0, 1, 2, 3 in Iq are potentially destined to output bands Oq and Oi, i.e., 
output lines through 7. 

For each input band, further define subsets li^jj C /j and lijj C /j, where lijj = {j : Tr{j) G Oi-i} and 
li.D = {j '■ T^U) G Oi+i}. Also define 7^,5 = li — {li.u {JU.d)- Members of Ii,u and lix, represent migrating 
inputs destined to the upper and lower bands respectively, while members of li^s represent stationary inputs, destined 
within the same band. 

Observation 1: \Ii^u\ = and = since the number of migrating inputs into, and out of, a 

band must be equal. 

Figure 3 describes a schematic of the proposed X-Benes rearrangeable network for a given value of K, Q < 
K < N/4 (the X-Benes is identical in performance to the standard Benes for K > N/4). The network consists 
of 2 log K + 2 columns of 2 x 2 switching elements and can be logically divided into three component stages by 
function. Packets in the network first go through a matching stage implemented via a matching network followed by 
a band-exchange stage implemented via shuffle-exchange interconnections, and finally a routing stage where packets 
are routed to their final destinations. 

The matching stage of the K-Benes network consists of N/ K matching networks M, , stacked over each other. Each 
matching network is the equivalent of & K x K Inverse Butterfly network and consists of log K columns of switching 




elements, numbered left to right from through log K — 1. 

The band-exchange stage of the if-Benes consists of two successive columns of switching elements together labeled 
as BE(iV, K). Each column of a BE(iV, K) implements a shuffle-exchange interconnection. The first 'even' band- 
exchange column implements a shuffle-exchange between the outputs of matching network pairs (Afo, Mi), (M2, M3), 
The second 'odd' band-exchange column implements shuffle-exchange interconnections between the outputs of match- 
ing network pairs (Mi , M2), (M3, M4), ... as they come out of the first column. BE(16,2) and BE(16,4) networks are 
also illustrated in Figure 4. 



BE(16,2) BE(16,4) 

Fig. 4 

Band-Exchange networks. 



Finally, the third logical stage of the if-Benes consists of N/ K routing networks Ri stacked over each other, where 
each Ri'i^ & K X K Butterfly. 

The _ftr-Benes network has the following property: 

Property I: The first log K + 1 together with the last log K stages of a regular N x N Benes are isomorphic to a 
JsT-Benes without the 'odd' band-exchange column in the BE(iV, K) network. 



An N X N Benes consists of an x Butterfly foflowed by an x Inverse Butterfly. The property foUows 
from the fact that the first log K +1 columns of an A^ x A^ Butterfly can be made equivalent to stacked K x K inverse 
butterflies followed by the even band-exchange column of BE(A^, K) by relabeling the switching elements in each 
succeeding column p using an ^^--way perfect shuffle, < p < log K — 1. (Please refer to [2] for details). Likewise 
the last log K columns of an A^ x A^ Benes can be made equivalent to stacked K x K Butterflies. 

Figure 5 illustrates this isomorphism between the first 3 stages of a Benes and a K-Benes followed by even band- 
exchange network {K = 4). 




Fig. 5 

Example illustrating isomorphism of Property 1 . 



Thus an A^ X A^ Benes with one extra stage (the odd band-exchange column of BE(A', K)) positioned appropriately, 
contains a K-Benes as an isomorphic subgraph. We use this property later to construct the control-optimal KR-Benes 
network. 

We will now show that the if -Benes is rearrangeable for i^T-bounded permutations. The main intuition behind our 
network design is as follows: Suppose we apply the A^-input Benes control looping algorithm to set the switches in 
the Matching and Routing networks of the if-Benes. By property [2 the K-Benes is an isomorphic subgraph of the 
Benes and we can obtain the switch settings of the matching and routing networks via the isomorphism. 

Now consider an arbitrary input a E li such that 7r(a) = f3, where tt is a /f -bounded permutation. Suppose a 
appears at output line j of Mi after following the above switch settings at each stage, < j < K — 1. If a is a 
migrating up (down) input, then following the switch settings from the j*'* input line of (i?i+i) will lead to 
output p. Similarly, if a is a non-migrating input, then following the switch settings from the j*^ input line of Ri will 
lead to output /?. (We will prove these two statements shortly). In order to prove the rearrangeability of the if-Benes, 
all we have to do is show that a migrating up (down) input a at output line j of Mi is matched by a migrating down 
(up) input at output line j of (M^+i). If this is case, we can set the j*'* switches of the corresponding BE(A^, K) 
networks to the 'cross' state thereby exchanging each migrating input with its 'matched' counterpart. At this point all 
migrating inputs are in the correct bands, at the exact positions they would be in if they had started out in that band 
and can be routed to their destinations by simply following the switch settings of the routing network. 

Consider an A^-input K-Benes, network where the switching elements (SEs) in the Mi and Ri networks are set using 
the A^-input looping algorithm, a £ li, (3, n and j are as defined above. Let bobi . . . biogK-i denote the routing path 



of a, from left to right through the log K stages of Mi, where b„i = (b„i = 1) represents whether a is routed to the 
upper or lower subnetwork in stage m of M^, <m< log i^T — 1 (i.e via the upper (0) or lower (1) output link of its 
SE in stage m). The following lemma formally proves the above statements. 

Lemma 1: If a is a migrating up (down) input that appears at output j of Mi, then following the switch settings 
from the input line of (-Ri+i) will lead to output /3. if a is a non-migrating input, then following the switch 
settings from the input line of Ri will lead to output /3. 

Proof: Since Mi is an inverse butterfly network, we have &o&i • • • ^log a'-i = by definition of the unique path 
property of the inverse butterfly [6]. Without loss of generality, assume a is a down migrating input. By property [l] 
the SEs set by the looping algorithm that lead to output j3 are in Ri+i. Let j be the input of Ri that leads to output (5. 
We now use the fact that the looping algorithm sets switching elements symmetrically, i.e. if an SE for a is set to 
(1) in stage m of Mi, it will be set to (1) in stage logi^ — to of Ri+i. Therefore the established routing path from (3 
to i is identical to the path from a to j. Ri is a butterfly network, however looking at Ri from the output side to the 
input side, it is an inverse butterfly. Therefore, by the routing property of inverse butterfly networks j = j. A similar 
argument holds true when a is an up migrating or non-migrating input. ■ 

We formally define the concept of matching inputs below. 

Definition 1: Matching Inputs: Inputs a G li.o and b £ li+i.u are said to be matching if they have the same 
routing paths (i.e they follow the same sequence of top and bottom SE links) in their respective matching networks. 
Matching is similarly defined for inputs in li jj and Ii-i,D, < i < {N/ K) — 1. 

Thus matching input pairs (if they exist), will be routed to the same output line of their respective matching networks. 
To prove that every migrating input has a matching pair, we use the following lemma. 

Lemma 2: Set the switches in the Matching and Routing networks using the looping algorithm. Consider the inputs 
at an arbitrary stage I of AU, i) <l < logJ-iT-l. Let ifjj and I^jj {ifi) and IPd, resp.) represent inputs in li^u (li^o 
resp.) routed via upper (lower, resp.) output links of the SE in stage I. Then we have. 



\lT.u\ = \iIud\ (1) 

l^fc/l = (2) 

\iId\ = \lli,u\ (3) 

\iEd\ = \lf+i,u\ (4) 

Proof: Please see Appendix. ■ 
Theorem 1: For every migrating input in d and li^jj, < z < ^ — 1, there exists a matching input in li+i^u and 
li-i^D respectively. 

Proof: By the matching lemma, at each stage of the K-Benes, the same number of migrating inputs (of each 
category: up or down) in li have been routed via an upper/lower SE link to the next stage as migrating inputs (of 
the same category) in and U+i. Therefore for each up/down migrating input at the outputs of Mi, there exists a 
down/up migrating input in li^JIi^i which has been routed through the same sequence of upper and lower SE links 
and therefore appears at the same output position in Mi^JMi^i, i.e these are matching inputs. ■ 



This leads us to the main result in this section. 

Theorem 2: The proposed K-Benes network is rearrangeable. 

We summarize the steps in the ii'-Benes permutation routing algorithm below. 

• Route the inputs in each matching network of the ii'-Benes using the control settings obtained by executing the 
A'^-input looping algorithm. 

• Exchange each migrating input in li^u and /j^d with a matching input in Ii-\^D and /i+i,c/, respectively. 

• Route inputs over each routing network using the control settings obtained previously by the looping algorithm. 
Both the K-Benes network architecture and the control algorithm are straightforward. The only 'hard' part of the 

process lies in recognizing that JC-bounded permutations can be routed by dividing the inputs into bands and noting 
that migrating inputs can be matched in adjacent bands when using the Benes control algorithm. 

Figure 6 illustrates permutation routing within a 16 x 16 K-Benes for K = A. The switch settings are obtained 
by considering the first and last two stages of the isomorphic Benes network. In this example, Mq consists of two 
migrating inputs (packets 4 and 5 at input Unes and 2, respectively). Note that packet 5 appears at ouput line 4 of 
Mo as does packet 1 of Mi, as claimed by lenrnia 2. Routing from input Unes 3 of Rq and i?i lead to outputs 1 and 5, 
as claimed in lemma 1 . Some of the matching input Unes and routing paths are drawn boldfaced for clarity. 
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BE(16,4) 



Fig. 6 

K-Benes permutation routing (K = 4, TV = 16). 

IV. Optimality of the K-Benes Network, < i^T < N/A 

The depth of a switching network is the number of columns or stages of switching elements. We will show in 
this section that any rearrangeable network for K-bounded permutations must have a depth at least 2 log K + 2 and 
therefore the i^T-Benes is optimal (For simplicity, we will assume K < N/ 4 in our discussions since the i^-Benes 
reduces to the Benes for K = N/2 and K = N). Also note that we only consider rearrangeable networks based on 



simple 2x2 switching elements, similar to the Benes. We do not consider networks based on comparators or higher 
degree multiplexors and demultiplexors. 

Any rearrangeable network must be able to route permutations containing the following iiT -bounded subpermuta- 
tions: 

1) TTi : TTi (j) = j+K, \fj e li, where i is even. Since tt is X-bounded this implies that all inputs in adjacent bands 
(starting from the Iq , Ii pair) are destined to each other. 

2) ■ T^2{j) ~ j^K, \/j G Ii, where i is even, i ^ {0, [N / K) — 1}. 

3) TTa : [iK, . . . , {i + l)K—l] [iK, . . . , {i + l)K—l] for a given input band It. Thus tts is an ai^biti-ary permutation 
within a given band. 

Define an even band-exchange network as a set of switches for interchanging inputs j and j + K for all such 
inputs in even band pairs {Iq, Ii), {I2, 13), ■ ■ ■■ Likewise define an odd band-exchange network for inputs in band 
pairs (/i, I2), {13,14), ■ ■ .. Henceforth we will use the term 'BE switch' to refer to a single switching element that 
implements an exchange between adjacent bands. Note that the above definition does not restrict all BE switches to 
be together in the same column (they could be dispersed through the rest of the network). However, when all the BE 
switches are together, these networks are identical to the columns of the BE(Af, K) network defined in the last section. 
An even band-exchange network can implement tti while an odd band-exchange network can implement tt2- tts can 
be implemented optimally hy a K x K Benes network. Moreover, none of these three subnetworks can implement 
any of the other two subpermutations. 

Lemma 3: Any rearrangeable network for X-bounded permutations must contain an even band-exchange stage, an 
odd band-exchange stage and a K x K Benes network as subgraphs. 

Proof: Any if-bounded permutation, < if < -j, may contain one or more of the three permutations above 
as subpermutations. Therefore any rearrangeable network must contain an even band-exchange stage, an odd band- 
exchange stage and a K x K Benes network as subgraphs. ■ 

Lemma 4: The optimal rearrangeable network for if-bounded permutations has exactly two band-exchange net- 
works as a subgraph. 

Proof: By symmetry, the optimal network must have the same number of even and odd band-exchange stages. 
From lemma|3] a total of two band-exchange stages is necessary. Four such stages would increase the minimum depth 
of the optimal network to 2 log A' + 3 stages which is more than the K-Benes, a contradiction. Therefore the optimal 
rearrangeable network contains exactly two band-exchange networks as a subgraph. ■ 

Lemma 13 implies that the position of the BE switches within the optimal rearangeable network is critical, since 
there can be only one band-exchange network of either type. Again note that the lemma does not restrict the optimal 
network to having the BE switches for all N/K bands together at the same depth (i.e equivalent to a BE(iV,i4r)). It 
merely states that there can be no more than two total band-exchange connections for each band (equivalently, exactly 
two BE switches at a given line number j, one for exchanging with either adjacent band). However, as we show below, 
BE switches (for any of the N / K bands) cannot all appear 'too early' in terms of their depth in the optimal network. 

Lemma 5: The optimal rearrangeable network contains at least one BE switch after log K stages, i.e stage log K+1 



onwards. 

Proof: We prove this statement by contradiction. Assume the last BE switch for inputs in bands li and /i+i 
occurs at depth \ogK (i.e after \ogK — 1 stages) or earUer. Note that we are considering rearrangeable networks 
consisting only of 2 x 2 switching elements. Any multistage interconnection network (MIN) composed of 2 x 2 
switching elements can be viewed as a union of complete binary trees, with switching elements as nodes. Consider a 
down migrating input at input line a G Ii (i.e 7r(a) G Oi+i). From the standard properties of binary trees, input line a 
in such a network can reach at most K/2 output lines over log K—1 stages. Let OXi be this reachable set for input 
line a. Without loss of generality (WLOG), consider the identical set of lines OXt+i in the next band . Using the 
BE switch under consideration, the input on line a can only be exchanged with one of the K/2 inputs in J^+i that can 
reach set OXi+i. If none of these inputs are up migrating inputs, then this particular exchanged input can never be 
sent back into its destination band O^+i as there are no BE switches after stage log K. Hence this input line will never 
be routed to its destination which contradicts the claim that the network is rearrangeable. Therefore the assumption 
must be incorrect, which proves the statement of the lemma. ■ 
Lemma 6: The optimal rearrangeable network contains at least one BE switch after \ogK + 1 stages, i.e stage 
log if + 2 onwards. 

Proof: The discussion in lemma|5lreferred to bands li and Ii+i. It independently applies to bands li and 
whose last BE switch(es) cannot be in the same stage as the BE switch(es) for bands Ii and li+i. They must be at least 
one stage later and hence the lemma. ■ 

WLOG using lemma|6j let there be a BE switch in stage log if + 2 of the optimal rearrangeable network. 

Lemma 7: The optimal rearrangeable network contains at least log K stages after stage log if + 2. 

Proof: Consider bands li and i,;+i. Figure 7 illustrates a possible routing situation after log if — 1 stages in 
the optimal rearrangeable network. In the figure, a is the solitary down migrating input in li with h the solitary up 
migrating input in i,+i. WLOG, the set of output lines reachable from input line a are the first if/2 lines of Oi (they 
could be any set of if/2 lines but we can assume they are the first if/2 lines for simplicity, without affecting the 
proof). Denote by Xa, the set of input lines (including a) which can reach this output set. Note that \Xa\ = if/2 
because of standard binary tree based MIN properties. The remaining Ya input lines of li reach the last if/2 lines of 
Oi, after log if — 1 stages, \Ya \ = K/2. Note that these Ya lines do not contain any migrating inputs. Similarly, assume 
input line b is positioned in U+i such that only the last if /2 lines of Oi+i are reachable from line b and define Xb and 
Yb similarly, where none of the Y\j lines contain any migrating inputs. 

Assume input line a (input h) can be exchanged with inputs from Yj, {Ya, resp.) in this stage if necessary, i.e there 
are BE switches present in this log if*'* stage. The optimal rearrangeable network algorithm can position a anywhere 
within the first if/2 lines and exchange it with the most suitable input line from Yj,- Let this line be a. Likewise, let 
/3 e Ya be the input that can be exchanged with b in this stage. We will show later that stage log if is the earliest stage 
in which a and a (b and /3, resp.) can be exchanged. 

Now note that if a and a are exchanged in stage log if, the earliest instance a can be exchanged again is stage 
log if +3. Let a and a be exchanged by BE switch j. First, we can assume that the SE in position j of stage log if +1 
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Fig. 7 

Illustration for Lemma^] Dashed lines in the figure indicate destination locations of inputs. Bold lines indicate the 

reachability of output lines from the given input lines. 



is a band-exchange switch for bands and li^i (by symmetry). Next, by lemma 0] the optimal network cannot 
contain another BE switch in position j. Therefore a must move to some other position before being input to another 
BE switch, which requires at least one stage. Therefore a cannot be exchanged with any other input from before 
stage log if +3. 

Now we have four possibilities: 

1) Only a and a are exchanged in stage log K. 

2) Only b and (3 are exchanged in stage log K. 

3) Both pairs are exchanged in stage log K. 

4) None of the pairs are exchanged in stage log K. 

If case 4 is true, assume WLOG that a and b are exchanged with each other. This can be done only in stage log K+1 
or stage log if + 2 (for the symmetric case of a and b in bands i; and ii_i). This is because a and b are present in 
complementary sets of output lines at the input to stage log K (as shown in Figure 7) and one more stage is required 
for them to match up (two more stages for the symmetric case). Once a and b are exchanged (effectively in stage 
logic + 2), they are in the same relative position within their bands. In log if — 1 stages after this point, a and b 
can reach a maximum of K/2 output lines. If a is destined to one set of K/2 output lines and b is destined to the 
complementary set of K/2 lines, then they cannot both be routed to their destinations using only log if — 1 stages. 
In this case, either log if more stages are necessary (which proves the lemma) or the optimal rearrangeable network 
(algorithm) can choose to exchange a with a first (case 1), so that a is now closer to its destination. However, now 
a is in the wrong band and must be exchanged with b or with (3 (case 3) in stage log if + 3 as shown earlier. At least 
log if — 1 more stages are required after this point to route a to its destination, thereby proving the lemma. An analgous 
analysis holds for case 2. 

We now show that the stage log if is the earliest at which a and a can be exchanged (followed by a and b later as 



described above). Suppose a is destined to an output among the top K/2 of output lines in Oi+i. Let all the inputs in 
Yb, except one, be destined to the remaininder of these K/2 output lines. Label this exceptional input a. a is destined 
to the bottom half of the the output lines in Oi+i. b is also destined to the bottom half of output lines in Oi (note 
that this is the necessary condition for case 1 to occur), and therefore if a and b are exchanged they can reach their 
destinations in \ogK—l stages. If any other input line from Yf, is exchanged with a, then that input and b cannot be 
routed to their output hnes in log K— \ stages which will increase the total number of stages in the optimal network to 
2 log K+'d, a contradiction. Now a and a must both arrive at position j in their respective bands to be exchanged. By 
the property of binary tree based MINs it takes exactly log K~ \ stages to achieve this. ■ 
Theorem 3: The ii'-Benes is an optimal rearrangeable network for iiT-bounded permutations. 

Proof: The result follows from lemmas 1-lemma 7. ■ 

V. The KR-Benes: A Control-Optimal Rearrangeable Network 

We now describe how the A'-Benes can be used as a building block to construct the rearrangeable KR-Benes 

network that is control-optimal for all input permutations. The KR-Benes is divided into regular Benes stages and 

Band-Exchange network stages. 

Consider the N x N Benes network with 2 log iV — 1 stages numbered from left to right from 1 onwards. The 

KR-Benes is constructed using the following steps: 

1) Insert Band-Exchange networks BE(A^, 2*) consisting of even and odd band-exchange columns immediately at 
the outputs of each stage i,l < i < log A^— 2 of the Benes (after renumbering them as per the isomorphism with 
the inverse Butterfly illustrated in Figure 5). 

2) We insert two kinds of bypass edges in the network that will allow an input to bypass some stages in the 
underlying network. 

a) First attach bypass edges leading from the outputs of the BE(A^, 2') network directly to the inputs of 
corresponding switching elements in stage 2 log N~i of the Benes (also renumbered as per the isomorphism 
with the inverse Butterfly). 

b) Next attach bypass edges at the output of each Benes stage to the input of the next stage that allow each 
Band-Exchange network itself to be bypassed. 

In a sense, the KR-Benes network consists of multiple planes: The front plane is the N x N Benes network while 
the BE(A^, 2') networks are in a (bypassable) parallel backplane between each of the first log A^— 2 stages of the Benes. 
Figure 8 describes an 8 x 8 KR-Benes. Note that there is only one Band-Exchange network in this case (a BE(8,2) 
after stage 1). 

Theorem 4: An N x N KR-Benes contains every if-Benes network as a subgraph, Q < K < N/ 4. 
Proof: By observation^in section 3, we know that the first and last log K stages of an x Benes together are 
equivalent to a K-Benes without its middle BE(A^, K) subnetwork. Therefore, the first log K stages of the KR-Benes 
(with the first log K~l BE() networks bypassed) followed by the BE(A^, K) network and then the last log K stages of 
the KR-Benes (reached via the bypass lines at the outputs of the BE(A^, K) network) clearly form a K-Benes network. 
Thus the subgraph property is satisfied. ■ 
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Corollary 1: The control complexity of the N x N KR-Benes is niin(A^(2 logK), N{2 logiV— 1)) for any arbitrary 
permutation tt on inputs, where K = max(|7r(z) — Q < i < N — 1 (K is rounded up to the nearest power of 
two). 

Note that the KR-Benes as described above contains 41ogiV — 5 stages (2 logiV — 1 Benes stages and 2(log7V — 2) 
Band-Exchange columns). A simple optimization can reduce the total number of stages in the KR-Benes to 3 log — 3 
(2 log — 1 Benes stages and log iV — 2 Band-Exchange columns). Consider the N x N Benes network constructed 
as a concatenation of N x N inverse butterfly and butterfly networks. Insert the 'odd' band-exchange column from 
BE(A^, 2*) immediately at the output of stage i + 1 of this Benes, 1 < i < logiV — 2. Using the fact that stage i of 
this Benes is isomorphic to the 'even' band-exchange column of BE(A^, 2*) (observation^ and inserting appropriate 
bypass lines, we note that this implementation of the KR-Benes also contains every K-Benes as a subgraph and hence 
is control-optimal. 

Further note that only one of the Band-Exchange networks will ever be used for routing any permutation. Hence 
an alternate implementation of the KR-Benes with 2 log N columns of switching elements can be derived by using a 
single odd Band-Exchange network (simulating different values of K) in the backplane to which outputs from various 
stages of the frontplane NxN Benes are multiplexed. However this scheme will require the use of several multiplexors 
at the inputs and outputs of the Band-Exchange column. 

Every K-Benes network is contained in a KR-Benes and the X-Benes control algorithm is the looping algorithm 
for the isomorphic Benes in its first and last log K stages. Therefore for an arbitrary input permutation, we only need 
to simulate if -Benes routing in the appropriate subgraph of the KR-Benes. The control-algorithm for the KR-Benes 
is as follows: 

• Run the Benes looping algorithm for the first (and last) stage of the KR-Benes. In the process, determine the 



value of K. 

• Use the above value of K to select bypass lines for the first log K — 1 Band-Exchange networks Note that the 
bypass lines can be selected in a self-routing manner after K has been determined while controlUng the first stage. 
Also mark those packets which are migrating inputs. 

• Exchange migrating inputs in the ^¥,{N, K) subnetwork. Since the inputs to be exchanged are marked, the BE(.) 
networks are self-routing. 

• Select bypass Unes at the output of BE(A/^, K) to reach the (2 log N — K)^^ Benes stage of the KR-Benes and 
route the packets to their destinations. 

For example, a K-Benes with K = A can be simulated by bypassing the first BE(A/', 2) network, passing through 
the BE(A?', 4) and then reaching the secondlast stage of the KR-Benes directly via bypass lines. Thus the control 
complexity of the KR-Benes is the same as that of of the corresponding K-Benes plus the constant overhead required 
to set the bypass Unks and BE(2) switches (which are both self-routing). 

The control complexity of the KR-Benes is superior to the Benes on average and its worst case is bounded by the 
Benes whose control complexity is N{2\ogN —1). We can evaluate the performance of the KR-Benes using the 
following metric: There are K\{K -\- 1)^-^ i^-bounded permutations. The KR-Benes control complexity for a 
given /C-bounded permutation is 2N \ogK,Q < K < N/A and N{2\ogN -1), otherwise. Thus the KR-Benes has 
superior control complexity as compared to the Benes for Pjv = (t)'(t + permutations. 

In an environment where all incoming permutations are equally likely, the average control complexity of the KR- 
Benes can be computed as follows: Let Pk represent the number of if-bounded permutations that are not 7^/2- 
bounded. We have = + 1)^"^ - ( f ) ! ( ^ ) ^ . The the average control complexity of the KR-Benes 

is: 

N/i 

1 + {2N^Pk- logK) + iV(21og7V-i)(7V! - Pn) 

K=2 

In many practical cases, the distribution of incoming ii'-bounded permutations to the KR-Benes is likely to be highly 
non-uniform and biased towards smaller K (due to locaUty properties), which enhances the control gain of the KR- 
Benes. Solutions for P2P file sharing, P2P overlay networks and P2P distributed hash tables such as CAN, Viceroy, 
KademUa etc. emphasize locality of access to increase throughput. Permutation networks serving routing needs for 
such networks are Ukely to receive a large number of bounded permutations. Among other examples, a permutation 
network serving as an optical switch [9] in a high-speed LAN environment might typically get most requests to and 
from addresses within the LAN with a few periodically outside the local area. Thus in these cases smaller bounded 
permutations are more likely as inputs, which can be routed more efficiently by the KR-Benes. 

VI. Conclusion 

This paper demonstrates the power and elegance of the Benes for rearrangeably routing permutations. By making 
small modifications to the Benes in order to exploit the locality property of permutations (also called iiT-boundedness), 



we can design a new network with 3 log A'' — 3 (alternately 2 log N) stages, that is control-optimal for arbitrary permu- 
tations. Its control complexity is superior to the Benes in many cases (« ( ) ! ( + 1) t permutations) and bounded 
by the Benes in the worst case. The K-Benes can be used for static traffic scenarios while the KR-Benes network can 
be used for optimally routing /^-bounded permutations, where the value of K changes dynamically. 



References 

[1] V. E. Benes, ''Optimal Rearrangeable Multistage Connecting Networks,"Bell System Technical Journal, 43 (1964), pp. 1641-1656. 
[2] F. T. Leighton. Introduction to Parallel Algorithms and Architectures: Arrays, Trees, Hypercubes. Morgan Kaufmann Publishers, San Mateo, 
CA, 1992. 

[3] David Nassimi, Sartaj Sahni, "A Self-Routing Benes Network and Parallel Permutation Algorithms," IEEE Transactions on Computers, 30(5): 
332-340 (1981) 

[4] S. Keshav. An Engineering Approach to Compter Networking. Addison- Wesley PubUshers, 1997. 

[5] B. Prabhakar and N. McKeown, "On the Speedup Required for Combined Input and Output Queued Switching," Automatica, 35(12):1909- 
1920, 1999. 

[6] J. Duato and L. Ni, "Interconnection Networks", IEEE Computer Society Press, Los Alamitos, CA., 1999. 

[7] David Nassimi and Sartaj Sahni, "Parallel Algorithms to Set Up the Benes Permutation Network," IEEE Transactions on Computers, 31(2): 
148-154 (1982). 

[8] A. Pattavina and G. Maier, "Photonic Rearrangeable Networks with Zero Switching-Element Crosstalk," lEEEINFOCOM 1999, pp. 337-344, 
April 1999. 

[9] Y. Yang, J. Wang and Y. Pan, "Permutation CapabiUty of Optical Multistage Interconnection Networks," /. of Parallel and Dist. Computing, 
Vol. 60, No. 1, pp. 72-91, Jan. 2000. 
[10] A.Waksman, "A Permutation Network," Journal of the ACM, Vol.15, No.l, pp.159-163, Jan. 1968. 

[11] K. Y. Lee, "On the Rearrangeability of (2 log N - I) Stage Permutation Networks, IEEE Trans. Computers, 34(5): 412-425 (1985). 
[12] D.C. Opferman and N.T Tsao-Wu, "On a Class of Rearrangeable Switching Networks, Part I: Control Algorithm", Bell System Technical J., 
Vol.50, pp.1,579-1,600, 1971. 

[13] M. K. Kim, H. Yoon and S. R. Maeng, "Bit-Permute Multistage Interconnection Networks," Microprocessing and Microprogramming, Vol. 
41, pp. 449^68, 1995. 

[14] Y-M. Yeh, T-Y. Feng, "On a Class of Rearrangeable Networks," IEEE Trans. Computers, 41(11): 1361-1379 (1992). 
[15] D. M. Koppelman, A. Y. Oruc, "A Self-Routing Permutation Network," J. Parallel Distrih. Comput, Vol. 10, No. 2, pp. 140-151, (1990). 
[16] C. Jan and Y. Oruc, "Fast Self-Routing Permutation Switching on an Asymptotically Minimum Cost Network," IEEE Trans, on Comput., pp. 
1369-1379, Dec. 1993. 

[17] H. Cam and J. A. B. Fortes, "A Fast VLSI-Efficient Self-Routing Permutation Network," IEEE Trans. Computers, 44(3): 448-453 (1995). 
[18] A. Samsudin and K. Y. Lee, " nD-dBPN: New Self-Routing Permutation Networks Based On the de Bruijn Digraphs," in Intnl. Conf. on 

Parallel Processing, pp. 604-611, 1998. 
[19] E. Lu and S. Q. Zheng, "A Fast Parallel Routing Algorithm for Benes Group Switches," Proc. of the 14th lASTED Intl. Conf on Parallel and 

Distributed Computing and Systems, pp. 67-72, Cambridge, MA, Nov. 2002. 
[20] S-W. Seo, T-Y. Feng and H-l.Lee, "Permutation Realizability and Fault Tolerance Property of the Inside-Out Routing Algorithm," IEEE 

Transactions on Parallel and Distributed Systems, Vol. 10, No. 9: pp. 946-957, 1999. 
[21] T-Y. Feng and S-W. Seo, "A New Routing Algorithm for a Class of Rearrangeable Networks," IEEE Transactions on Computers, Vol. 43, No. 

11: pp. 1270-1280, 1994. 

[22] M. K. Kim, H. Yoon and S. R. Maeng, "On the Correctness of Inside-Out Routing Algorithm," IEEE Transactions on Computers, Vol. 46, 
No. 7, pp. 820-823, 1997. 



Appendix 



17 



Appendix 

A. Preliminary definitions for proving lemma^ 

Permutation Graph: Let n : [0, .., N — 1] — > [0, .., N — 1] denote a permutation on A'^ inputs. We define the permutation graph D 
as a bipartite graph on vertices Vi = [0, . . . , A'^ — 1] and V2 = [0, . . . , — 1] with an edge between i G Vi and j £ V2 if j ~ 
For the A"-Benes network, we group pairs of adjacent vertices in Vi and V2 to logically represent switching elements (SEs). Let Si 
denote the SE of input i. i denotes its companion input to the SE, where i = i — 1 if i is odd, and i + 1 otherwise. Thus, there is 
an edge in H from each switching element 5"; to switching elements ^^(i) and •S'^(-j)- A path in the permutation graph alternately 
traverses switching elements on the input and output side. Fig. 9 illustrates the permutation graph on 8 inputs for the 4-bounded 

Note that the looping algorithm can be viewed as setting SEs to alternate up and 
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down subnetworks while following a path in H. 
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Fig. 9 

Permutation graph for the given permutation. 



Consider a /f-Benes network with K — ^ ^nd a permutation graph 11. Note thate there are only two input bands Jo and Ji. 
Compatibility Graph: We construct a compatibility graph G = {V, E) based on the permutation graph fl as follows: Each edge 
in n between a migrating input in Jo or Ji to an output in Oi or Oo is represented as a vertex in V . V consists of two subsets 
Vi = {(a, n{a))\a G Jo.d} and V2 = {{b, n{b))\b G h,u}. 

There are two kinds of edges between vertices in V. There exists a cross edge labeled i between vertices ui — {a, 7r(a)) G Vi 
and U2 — {b, 7r(6)) G V2 if there is a path of odd length in LI between Sa and S^(^t) 01 between 5*6 and ^^(a) consisting only of 
switching elements in li and Oi, i G {0, 1}. Similarly, there exists a straight edge labeled i between these vertices if a and 6 are 
in the same input band and there is a path of even length in LI between Sa and St or S'.^(a) and ^-^(i,) consisting only of switching 
elements in li and Oi, i G {0, 1}. Figure 10 illustrates the compatibility graph corresponding to the given permutation on 8 inputs. 
The cross edge between vertices (3,6) and (4,1) is labeled since it arises due to a path of length one in Jo. The straight edge 
between (0,4) and (1,5) is labeled 01 since it arises due to a path of length zero in both Jo and Ji. 
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Fig. 10 

COMPTABILITY GRAPH CORRESPONDING TO H. CROSS EDGES ARE SHOWN DASHED. 



A compatibility graph G has the following properties: 

Property 2: If vertices (a, 7r(a)) and (&, 7r(6)) are connected by a cross edge in G, then a and b are both directed to top (bottom) 
subnetworks in the next stage via upper (lower) links of their SEs. 



Proof: Vertices sharing a cross edge are connected by a patli of odd lengtli in 11. The looping algorithm for 11 sets SEs on a 
path to alternate top and bottom subnetworks. ■ 
Similarly, we have 

Property 3: If vertices (a,7r(a)) and {b,-n{b)) are connected by a straight edge in G, then a and b are directed to opposite 
subnetworks in the next stage via opposite links of their SEs. 
Finally, 

Property 4: The number of vertex pairs in Vi and V2 connected by straight edges with same label (i.e or 1) are identical. 
Proof: If a vertex in Vi (V2) is not connected to a vertex in V2 (Vi) by a cross edge labeled (1, resp.), then it must be 
connected to another vertex in Vi (V2) by a straight edge labeled (1, resp.). Hence there must be an identical number of such pairs 
in Vi and V2 . ■ 

B. Proof of lemma^ 

Consider a P x P Benes network implementing a A'-bounded permutation n. Set the switches in the first stage of the Benes 
using the looping algorithm which alternately routes members of each input and output pair to top and bottom subnetworks in the 
next stage. By the isomorphism of the Matching networks of a A'-Benes with the first log A" stages of the Benes, proving the lemma 
is equivalent to proving the stated lemma conditions for the first stage of the above Benes network. 

We prove the matching lemma by induction on P. For the base case, consider P — 2A, i.e there are only two bands Jo and 
Ii. In this case, we need to show that L/Tfjl ~ \Io,d\ ^nd \Ii^u\ ~ \Io,d\- To prove this, consider only edges labeled in the 
compatibility graph for vr. By property|2| all vertices connected by cross edges represent (matching) inputs directed to the same 
top or bottom subnetwork. By properties |3| and |4] all inputs in Vi connected by straight edges labeled have a (matching) input 
in V2 that is directed to the same (top or bottom) subnetwork. Therefore the lemma conditions in the base case of the induction 
hypothesis hold. 

Now consider any A'-bounded permutation on P inputs and assume the inductive hypothesis holds for fewer than P inputs. We 
will need to find an exact matching between inputs in Jo.o and h^u- Assume for the moment that such a matching exists and let 
Vi G Io,D be the matching input for each Xi £ /i.y, 1 < * < Consider the permutation tt on P — K inputs defined as 

follows: 

9{xi) = T^ivi) 1 < i < \h,u\ 

n{a) — 7r(a) for all other inputs a. 

TT is a A'-bounded permutation and hence by the inductive hypothesis, the matching lemma is satisfied for P — K inputs. Inputs 
Xi € /i,i7 are thus assigned top/bottom subnetworks in a manner consistent with all other inputs. Matching inputs yi G /o,d 
can now be assigned the same subnetworks, followed by the remaining inputs in Jq, without conflict. Thus the conditions of the 
matching lemma are satisfied for any set of P inputs and permutation tt. 

To complete the proof, we need to find an exact matching between inputs in Iq^d and I\^u- First construct the compatibility 
graph G on down migrating inputs in Jo and up migrating inputs in Ji using only edges labeled 0. Inputs in /o represented in Vi 
with a cross edge to V2 have found a matching input in /i that is directed to the same subnetwork in the next stage. The remaining 
vertices in Vi and V2 are connected in pairs by straight edges. We need to find an exact matching for the migrating inputs in Jo and 
Ji that are represented in these remaining vertices. 

Consider one such pair {x\,-k{xi)) and (x2, n{x2)) connected by a straight edge in V2 along with {yi, n{yi)) and (1/2, 7r(j/2)) 
connected by a straight edge in Vi. Consider the path of the form Sxi , •S'^(^), ... in permutation graph 11. There are two possible 
cases: 

• Case I: The path is of odd length and terminates in S^fy-^) (or STr(y2))- 

• Case II: The path is of even length and terminates in ■ (including the case X2 = 'xT). 

Figure 6 illustrates these scenarios. If (I) is true, then xi and yi (or j/2) are matching inputs and so are X2 and 1/2 (or yi). If (II) 
is true, then by symmetry the path in 11 from Sy^ , 5^(55-) 1 ■ ■ • onwards will also be of even length and terminate in Sy2 in which 
case xi and X2 can be matched interchangeably with yi and j/2- 

Matches for the remaining migrating inputs in Jo and Ji can be defined similarly. 
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Fig. 11 

Finding matching inputs in IT. 



